Ballast: Blast post-processing based on locally conserved segments
نویسندگان
چکیده
MOTIVATION Blast programs are very efficient in finding relatively strong similarities but some very distantly related sequences are given a very high Expect value and are ranked very low in Blast results. We have developed Ballast, a program to predict local maximum segments (LMSs-i.e. sequence segments conserved relatively to their flanking regions) from a single Blast database search and to highlight these divergent homologues. The TBlastN database searches can also be processed with the help of information from a joint BlastP search. RESULTS We have applied the Ballast algorithm to BlastP searches performed with sequences belonging to well described dispersed families (aminoacyl-tRNA synthetases; helicases) against the SwissProt 38 database. We show that Ballast is able to build an appropriate conservation profile and that LMSs are predicted that are consistent with the signatures and motifs described in the literature. Furthermore, by comparing the Blast, PsiBlast and Ballast results obtained on a well defined database of structurally related sequences, we show that the LMSs provide a scoring scheme that can concentrate on top ranking distant homologues better than Blast. Using the graphical user interface available on the Web, specific LMSs may be selected to detect divergent homologues sharing the corresponding properties with the query sequence without requiring any additional database search.
منابع مشابه
کاربری پروتیینهای جدید در ساخت واکسن استافیلوکوکوس اورئوس
Background: Staphylococcus aureus and Staphylococcus epidermidis are major human pathogens of increasing importance due to the spread of antibiotic resistance. Novel potential targets for therapeutic antibodies are products of staphylococcal genes expressed during human infection. Previously, the secreted and surface-exposed proteins among seroreactive antigens have been discovered. Furthermore...
متن کاملHigh precision detection of conserved segments from synteny blocks
A conserved segment, i.e. a segment of chromosome unbroken during evolution, is an important operational concept in comparative genomics. Until now, algorithms that are designed to identify conserved segments often return synteny blocks that overlap, synteny blocks that include micro-rearrangements or synteny blocks erroneously short. Here we present definitions of conserved segments and synten...
متن کاملDbClustal: rapid and reliable global multiple alignments of protein sequences detected by database searches.
DbClustal addresses the important problem of the automatic multiple alignment of the top scoring full-length sequences detected by a database homology search. By combining the advantages of both local and global alignment algorithms into a single system, DbClustal is able to provide accurate global alignments of highly divergent, complex sequence sets. Local alignment information is incorporate...
متن کاملField Imaging Based Assessment of In-Service Ballast Condition Authors
Ballast degradation can cause poor drainage, settlement and reduced lateral stability, which adversely affect railroad track safety and performance. Two indices are commonly used in the North American railroad industry to quantify the level of ballast degradation. These are (i) Fouling Index (FI) and (ii) Percentage Fouling (PF). Identifying these two traditional indices involve ballast samplin...
متن کاملBEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results.
BEAUTY (BLAST enhanced alignment utility) is an enhanced version of the NCBI's BLAST data base search tool that facilitates identification of the functions of matched sequences. We have created new data bases of conserved regions and functional domains for protein sequences in NCBI's Entrez data base, and BEAUTY allows this information to be incorporated directly into BLAST search results. A Co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 16 9 شماره
صفحات -
تاریخ انتشار 2000